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PLANT FATTY ACXD SYNTHASES AND USE IN IMPROVED METHODS FOR 
PRODUCTION OF MEDIUM- CHAIN FATTY ACIDS 

INTRODUCTION 

Field of Inventipn 

The present invention is directed to genes encoding 
plant fatty acid synthase enzymes relevant to fatty acid 
synthesis in plants, and to methods of using such genes in 
combination with genes encoding plant medium-chain 
preferring thioesterase proteins. Such uses provide a 
method to increase the levels of medium-chain fatty acids 

0 

that may be produced in seed oils of transgenic plants. 
Background 

Higher plants synthesize fatty acids via a common 
metabolic pathway. In developing seeds, where fatty acids 
attached to triglycerides are stored as a source of energy 
for further germination, the fatty acid synthesis pathway is 
located in the plastids . The first step is the formation of 
acetyl-ACP (acyl carrier protein) from acetyl -CoA and ACP 
catalyzed by a short chain preferring condensing enzyme, £- 
ketoacyl-ACP synthase (KAS) III. Elongation of acetyl-ACP 
to 16- and 18- carbon fatty acids involves the cyclical 
action of the following sequence of reactions: condensation 
with a two-carbon unit from malonyl-ACP to form a longer £- 
ketoacyl-ACP (£-ketoacyl-ACP synthase) , reduction of the 



WO 98/46776 2 PCT/US98/07114 

keto- function to an alcohol (£-ketoacyl-ACP reductase) , 
dehydration to form an enoyl-ACP (S-hydroxyacyl-ACP 
dehydrase) , and finally reduction of the enoyl-ACP to form 
the elongated saturated acyl-ACP (enoyl-ACP reductase) . 6- 
5 ketoacyl-ACP synthase I (KAS I) , is primarily responsible 
for elongation up to palmitoyl-ACP (C16:0), whereas 6- 
ketoacyl-ACP synthase II (KAS II) is predominantly 
responsible for the final elongation to stearoyl-ACP 
(C18:0) . 

10 Genes encoding peptide components of £-ketoacyl -ACP 

synthases I and II have been cloned from a number of higher 
plant species, including castor (Ricinus communis) and 
Brassica species (USPN 5,510,255). KAS I activity was 
associated with a single synthase protein factor having an 

15 approximate molecular weight of 50 kD (synthase factor B) 

and KAS II activity was associated with a combination of two 
synthase protein factors, the 50 kD synthase factor B and a 
46 kd protein designated synthase factor A. Cloning and 
sequence of a plant gene encoding a KAS III protein has been 

20 reported by Tai and Jaworski (Plant Physiol. (1993) 
103: 1361-1367) . 

The end products of plant fatty acid synthetase 
activities are usually 16- and 18-carbon fatty acids. There 
are, however, several plant families that store large 

25 amounts of 8- to 14-carbon (medium-chain) fatty acids in 
their oilseeds. Recent studies with Umbellularia 
calif ornica (California bay) , a plant that produces seed oil 
rich in lauric acid (12:0), have demonstrated the existence 
of a medium-chain-specific isozyme of acyl-ACP thioesterase 
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in the seed plastids. Subsequent purification of the 12:0- 
. ACP thioesterase from Umbel lul ari a californica led to the 
cloning of a thioesterase cDNA which was expressed in seeds 
of Arabidopsis and Brassica resulting in a substantial 
accumulation of lauric acid in the triglyceride pools of 
these transgenic seeds (USPN 5,512,482). These results and 
subsequent studies with medium-chain thioes terases from 
other plant species have confirmed the chain-length- 
determining role of acyl-ACP thioesterases during de novo 
fatty acid biosynthesis (T. Voelker (1996) Genetic 
Engineering, Ed. J. K. Setlow, Vol. 18, pgs . 111-133). 

DESCRIPTION OF THE FIGURES 

Figure 1. DNA and translated amino acid sequence of Cuphea 
hookeriana KAS factor B clone chKAS B-2 are provided. 
Figure 2. DNA and translated amino acid sequence of Cuphea 
hookeriana KAS factor B clone chKAS B-31-7 are provided. 
Figure 3 . DNA and translated amino acid sequence of Cuphea 
hookeriana KAS factor A clone chKAS A-2-7 are provided. 
Figure 4. DNA and translated amino acid sequence of Cuphea 
hookeriana KAS factor A clone chKAS A- 1-6 are provided. 
Figure 5. DNA and translated amino acid sequence of Cuphea 
pullcherrima KAS factor B clone cpuKAS B/7-8 are provided. 
Figure 6 . DNA and translated amino acid sequence of Cuphea 
pullcherrima KAS factor B clone cpuKAS B/8-7A are provided. 
Figure 7. DNA and translated amino acid sequence of Cuphea 
pullcherrima KAS factor A clone cpuKAS A/p7-6A are provided. 
Figure 8. Preliminary DNA sequence of Cuphea pullcherrima 
KAS factor A clone cpuKAS A/p8-9A is provided. 
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Figure 9. DNA and translated amino acid sequence of Cuphea 
hookeriana KASIII clone chKASIII-27 are provided. 
Figure 10. The activity profile for purified cpuKAS B/8-7A 
using various acyl-ACP substrates is provided. 
5 Figure 11. The activity profile for purified chKAS A-2-7 
and chKAS A-l-6 using various acyl-ACP substrates is 
provided . 

Figure 12. The activity profile for purified castor KAS 
factor A using various acyl-ACP substrates is provided. 

10 Figure 13 . The activity profile for purified castor KAS 
factor B using various acyl-ACP substrates is provided. 
Figure 14 . A graph showing the number of plants arranged 
according to C8 : 0 content for transgenic plants containing 
CpFatBl versus transgenic plants containing CpFatBl + chKAS 

15 A-2-7 is provided. 

Figure 15. Graphs showing the %C10/%C8 ratios in transgenic 
plants containing ChFatB2 (4804-22-3 57) and in plants 
resulting from crosses between 4804-22-357 and 5401-9 (chKAS 
A-2-7 plants) are provided. 

20 Figure 16 . Graphs showing the %C10 + %C8 contents in 

transgenic plants containing ChFatB2 (4804-22-357) and in 
plants resulting from crosses between 4804-22-357 and 5401-9 
(chKAS A-2-7 plants) are provided. 

Figure 17. Graphs showing the %C10/%C8 ratios in transgenic 
25 plants containing ChFatB2 (4804-22-357) and in plants 

resulting from crosses between 4804-22-357 and 5413-17 (chKAS 
A-2-7 + CpFatBl plants) are provided. 

Figure 18 . Graphs showing the %C10 + %C8 contents in 
transgenic plants containing ChFatB2 (4804-22-357) and in 
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plants resulting from crosses between 4804-22-357 and 5413-17 
(chKAS A-2-7 + CpFatBl plants) are provided. 

Figure 19. Graphs showing the %C12.:0 in transgenic plants 
containing Uc FatBl (LA86DH186) and in plants resulting from 
5 crosses with wild type (X WT) and with lines expressing Ch 
KAS A-2-7 . 

Figure 20. Graph showing the relative proportions of C12:0 
and C14:0 fatty acids in the seeds of transgenic plants 
containing Uc FatBl (LA86DH186) and in plants resulting from 
10 crosses with wild type (X WT) and with lines expressing Ch 
KAS A-2-7 . 

Figure 21. Graphs showing the %C18:0 in transgenic plants 
containing Garm FatBl (5266) and in seeds of plants resulting 
from crosses with wild type (X WT) and with lines expressing 
15 Ch KAS A-2-7 . 

Figure 22. The activity profile of Ch KAS A in protein 
extracts from transgenic plants containing Ch KAS A-2-7. 
Extracts were preptreated with the indicated concentrations 
of cerulenin. 

20 

SUMMARY OF THE INVENTION 

By this invention, compositions and methods of use 
related to £-ketoacyl-ACP synthase (KAS) are provided. Also 
of interest are methods and compositions of amino acid and 
25 nucleic acid sequences related to biologically active plant 
synthase (s) . 

In particular, genes encoding KAS protein factors A and 
B from Cuphea species are provided. The KAS genes are of 
interest for use in a variety of applications, and may be 
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used to provide synthase I and/or synthase II activities in 
'transformed host cells, including bacterial cells, such as 
E. coli, and plant cells. Synthase activities are 
distinguished by the preferential activity towards longer 
5 and shorter acyl-ACPs as well as by the sensitivity towards 
! the KAS specific inhibitor, cerulenin. Synthase protein 
\ preparations having preferential activity towards medium 
chain length acyl-ACPs are synthase I-type or KAS I. The 
KAS I class is sensitive to inhibition by cerulenin at 
10 concentrations as low as 1}IM. Synthases having preferential 
activity towards longer chain length acyl-ACPs are synthase 
II -type or KAS II. The KAS enzymes of the II-type are also 
sensitive to cerulenin, but at higher concentrations (50|0M) . 
Synthase Ill-type enzymes have preferential activity towards 
15 short chain length acyl-ACPs and are insensitive to 
cerulenin inhibition . 

Nucleic acid sequences encoding a synthase protein may 
be employed in nucleic acid constructs to modulate the 
amount of synthase activity present in the host cell, 
20 especially the relative amounts of synthase I -type, synthase 
II-type and synthase Ill-type activity when the host cell is 
a plant host cell. In addition, nucleic acid constructs may 
be designed to decrease expression of endogenous synthase in 
a plant cell as well. One example is the use of an anti- 
25 sense synthase sequence under the control of a promoter 

capable of expression in at least those plant cells which 
normally produce the enzyme. 

Of particular interest in the present invention is the 
coordinate expression of a synthase protein with the 
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expression of thioesterase proteins. For example, 
coordinated expression of synthase factor A and a medium- 
chain thioesterase provides a method for increasing the 
level of medium-chain fatty acids which may be harvested 
5 from transgenic plant seeds. Furthermore, coordinated 

expression of a synthase factor A gene with plant medium- 
chain thioesterase proteins also provides a method by which 
the ratios of various medium-chain fatty acids produced in a 
transgenic plant may be modified. For example, by 
10 expression of a synthase factor A, it is possible to 

increase the ratio of C10/C8 fatty acids which are produced 
in plant seed oils as the result of ! expression of a 
thioesterase having activity on C8 and CIO fatty acids. 

15 DETAILED DESCRIPTION OF THE INVENTION 

A plant synthase factor protein of this invention 
includes a sequence of amino acids or polypeptide which is 
required for catalyzation of a condensation reaction between 
an acyl-ACP having a chain length of C2 to Ci6 and malonyl- 

20 ACP in a plant host cell. A particular plant synthase 
factor protein may be capable of catalyzing a synthase 
reaction in a plant host cell (for example as a monomer or 
homodimer) or may be one component of a multiple peptide 
enzyme which is capable of catalyzing a synthase reaction in 

25 a plant host cell, i.e. one peptide of a heterodimer . 

Synthase I (KAS I) demonstrates preferential activity 
towards acyl-ACPs having shorter carbon chains, C2-C14 and 

is sensitive to inhibition by cerulenin at concentrations of 
1|1M. Synthase II (KAS II) demonstrates preferential 
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activity towards acyl-ACPs having longer carbon chains, C3.4- 
C3.6, and is inhibited by concentrations of cerulenin (50fiJyi) - 
Synthase- III demonstrates preferential activity towards 
acyl-CoAs having very short carbon chains, C2 to C6, and is 
5 insensitive to inhibition by cerulenin- 
! Synthase factors A and B, and synthase III proteins 

\ obtained from medium- chain fatty acid producing plant 
species of the genus Cuphea are described herein. As 
described in the following Examples, synthase A from C. 

10 hookeriana is naturally expressed at a high level and only 
in the seeds. C. hookeriana synthase B is expressed at low 
levels in all tissues examined. Expression of synthase A 
and synthase B factors in E. coli and purification of the 
resulting proteins is employed to determine activity of the 

15 various synthase factors. Results of these analyses 

indicate that synthase factor A from Cuphea hookeriana has 
the greatest activity on 6:0-ACP substrates, whereas 
synthase factor B from Cuphea pullcherrima has greatest 
activity on 14:0-ACP. Similar studies with synthase factors 

20 A and B from castor demonstrate similar activity profiles 
between the factor B synthase proteins from Cuphea and 
castor. The synthase A clone from castor, however, 
demonstrates a preference for 14:0-ACP substrate. 

Expression of a Cuphea hookeriana KAS A protein in 

25 transgenic plant seeds which normally do not produce medium- 
chain fatty acids does not result in any detectable 
modification of the fatty acid types and contents produced 
in such seeds. However, when Cuphea hookeriana KAS A 
protein is expressed in conjunction with expression of a 
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medium-chain acyl-ACP thioesterase capable of providing for 
production of C8 and CIO fatty acids in plant seed oils, 
increases in the levels of medium-chain fatty acids over the. 
levels obtainable by expression of the medium-chain 
thioesterase alone are observed. In addition, where 
significant amounts of C8 and CIO fatty acids are produced 
as the result of medium-chain thioesterase expression, co- 
expression of a Cuphea KAS A protein also results in an 
alteration of the proportion of the C8 and CIO fatty acids 
that are obtained. For example, an increased proportion of 
CIO fatty acids may be obtained by co-expression of Cuphea 
hookeriana ChFatB2 thioesterase and a chKAS A synthase 
factor proteins. 

Furthermore, when Cuphea hookeriana KAS A protein is 
expressed in conjunction with expression of a medium-chain 
acyl-ACP thioesterase capable of providing for production of 
C12 fatty acids in plant seed oils, increases in the levels 
of medium-chain fatty acids over the levels obtainable by 
expression of the medium-chain thioesterase alone are also 
observed. In addition, where significant amounts of C12 and 
C14 fatty acids are produced as the result of medium-chain 
thioesterase expression, co-expression of a Cuphea KAS A 
protein also results in an alteration of the proportion of 
the C12 and C14 fatty acids that are obtained. For example, 
an increased proportion of C12 fatty acids may be obtained 
by co-expression of Uc FatBl thioesterase and a chKAS A 
synthase factor proteins . 

However, when Cuphea hookeriana KAS A protein is 
expressed in conjunction with the expression of a long-chain 
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acyl-ACP thioesterase capable of providing for production of 
C18 and C18:l fatty acids in plant seed oils, no effect on 
the production of long chain fatty acids was observed. 
Furthermore, when plants transformed to express a long chain 
acyl-ACP thioesterase from mangos teen (Gar/nFatAl, U.S. 
Patent Application No. 08/440,845), which preferentially 
hydrolyzes C18:0 and C18:l fatty acyl-ACPs, are crossed with 
nontrans formed control plants, a significant reduction in 
the levels of C18:0 is obtained. Similar reductions are also 
observed in the levels of C18:0 in the seeds of plants 
resulting from crosses between plants transformed to express 
the GarmFatAl and plants expressing the Cuphea hookeriana 
KAS A protein. 

Thus, the instant invention provides methods of 
increasing and/or altering the medium-chain fatty acid 
compositions in transgenic plant seed oils by co-expression 
of medium-chain acyl-ACP thioes terases with synthase factor 
proteins. Furthermore, various combinations of synthase 
factors and medium-chain thioesterases may be achieved 
depending upon the particular fatty acids desired. For 
example, for increased production of C14 fatty acids, 
synthase protein factors may be expressed in combination 
with a C14 thioesterase, for example from Cuphea palustris 
or nutmeg may be employed (WO 96/23892) . In addition, 
thioesterase expression may be combined with a number of 
different synthase factor proteins for additional effects on 
medium-chain fatty acid composition. 

Synthases of use in the present invention include 
modified amino acid sequences, such as sequences which have 
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been mutated, truncated, increased and the like, as well as 
such sequences which are partially or wholly artificially 
synthesized. The synthase protein encoding sequences 
provided herein may be employed in probes for further 
5 screening or used in genetic engineering constructs for 
transcription or transcription and translation in host 
cells, especially plant host cells. One skilled in the art 
will readily recognize that antibody preparations, nucleic 
acid probes (DNA and RNA) and the like may be prepared and 

10 used to screen and recover synthases and/or synthase nucleic 
acid sequences from other sources. Typically, a 
homologously related nucleic acid sequence will show at 
least about 60% homology, and more preferably at least about 
70% homology, between the -R . communis synthase' and the given 

15 plant synthase of interest, excluding any deletions which 
may be present. Homology is determined upon comparison of 
sequence information, nucleic acid or amino acid, or through 
hybridization reactions . 

Recombinant constructs containing a nucleic acid 

20 sequence encoding a synthase protein factor or nucleic acid 
sequences encoding a synthase protein factor and a medium- 
chain acyl-ACP thioesterase may be prepared by methods well 
known in the art. Constructs may be designed to produce 
synthase in either prokaryotic or eukaryotic cells. The 

25 increased expression of a synthase in a plant cell, 

particularly in conjunction with expression of medium-chain 
thioesterases , or decreasing the amount of endogenous 
synthase observed in plant cells are of special interest. 
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Synthase protein factors may be used, alone or in 
combination, to catalyze the elongating condensation 
reactions of fatty acid synthesis depending upon the desired 
result. For example, rate influencing synthase activity may 
5 reside in synthase I-type, synthase Il-type, synthase III- 
\ type or in a combination of these enzymes. Furthermore, 
\ synthase activities may rely on a combination of the various 
synthase factors described herein. 

Constructs which contain elements to provide the 
10 transcription and translation of a nucleic acid sequence of 
interest in a host cell are "expression cassettes". 
Depending upon the host, the regulatory regions will vary, 
including regions from structural genes from viruses, 
plasmid or chromosomal genes, or the like. For expression 
15 in prokaryotic or eukaryotic microorganisms, particularly 
unicellular hosts, a wide variety of constitutive or 
regulatable promoters may be employed. Among 

transcriptional initiation regions which have been described 
are regions from bacterial and yeast hosts, such as E. coli, 
20 B . subtilis, Saccharomyces cerevisiae, including genes such 
as £-galactosidase, T7 polymerase, trp-lac (tac) , trp E and 
the like. 

An expression cassette for expression of synthase in a 
plant cell will include, in the 5' to 3 * direction of 
25 transcription, a transcription and translation initiation 
control regulatory region {also known as a "promoter") 
functional in a plant cell, a nucleic acid sequence encoding 
a synthase, and a transcription termination region. 
Numerous transcription initiation regions are available 
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which provide for a wide variety of constitutive or 
regulatable, e.g., inducible, transcription of the 
desaturase structural gene. Among transcriptional 
initiation regions used for plants are such regions 
associated with cauliflower mosaic viruses (35S, 19S) , and 
structural genes such as for nopaline synthase or mannopine 
synthase or napin and ACP promoters, etc. The 
transcription/ translation initiation regions corresponding 
to such structural genes are found immediately 5' upstream 
to the respective start codons . Thus, depending upon the 
intended use, different promoters may be desired. 

Of special interest in this invention are the use of 
promoters which are capable of preferentially expressing the 
synthase in seed tissue, in particular, at early stages of 
seed oil formation. Examples of such seed-specific promoters 
include the region immediately 5' upstream of a napin or 
seed ACP genes such as described in USPN 5,420,034, 
desaturase genes such as described in Thompson et al (Proc. 
Nat. Acad. Sci . (1991) 88 : 2578-2582 ) , or a Bce-4 gene such 
as described in USPN 5,53 0,194. Alternatively, the use of 
the 5' regulatory region associated with the plant synthase 
structural gene, i.e., the region immediately 5' upstream to 
a plant synthase structural gene and/or the transcription 
termination regions found immediately 3 ' downstream to the 
plant synthase structural gene, may often be desired. In 
general, promoters will be selected based upon their 
expression profile which may change given the particular 
application. 
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In addition, one may choose to provide for the 
transcription or transcription and translation of one or 
more other sequences of interest in concert with the 
expression or anti-sense of the synthase sequence, 
5 particularly medium-chain plant thioesterases such as 

described in USPN 5,512,482, to affect alterations in the 
amounts and/or composition of plant oils. 

When one wishes to provide a plant transformed for the 
combined effect of more than one nucleic acid sequence of 

10 interest, a separate nucleic acid construct may be provided 
for each or the constructs may both be present on the same 
plant transformation construct. The constructs may be 
introduced into the host cells by the same or different 
methods, including the introduction of such a trait by 

15 crossing transgenic plants via traditional plant breeding 

methods, so long as the resulting product is a plant having 
both characteristics integrated into its genome. 

Normally, included with the DNA construct will be a 
structural gene having the necessary regulatory regions for 

20 expression in a host and providing for selection of 

transformed cells. The gene may provide for resistance to a 
cytotoxic agent, e.g. antibiotic, heavy metal, toxin, etc., 
complementation providing prototrophy to an auxotrophic 
host, viral immunity or the like. Depending upon the number 

25 of different host species into which the expression 

construct or components thereof are introduced, one or more 
markers may be employed, where different conditions for 
selection are used for the different hosts. 
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The manner in which the DNA construct is introduced 
'into the plant host is not critical to this invention. Any 
method which provides for efficient transformation may be 
employed. Various methods for plant cell transformation 
include the use of Ti- or Ri-plasmids , microinjection, 
electroporation, liposome fusion, DNA bombardment or the 
like. In many instances, it will be desirable to have the 
construct bordered on one or both sides by T-DNA, 
particularly having the left and right borders, more 
particularly the right border. This is particularly useful 
when the construct uses A. tumefaciens or A. rhizogenes as a 
mode for transformation, although the T-DNA borders may find 
use with other modes of transformation. 

, The expression constructs may be employed with a wide 
variety of plant life, particularly plant life involved in 
the production of vegetable oils. These plants include, but 
are not limited to rapeseed, peanut, sunflower, safflower, 
cotton, soybean, corn and oilseed palm. 

For transformation of plant cells using . Agrobacterium, 
explants may be combined and incubated with the transformed 
Agrobacterium for sufficient time for transformation, the 
bacteria killed, and the plant cells cultured in an 
appropriate selective medium. Once callus forms, shoot 
formation can be encouraged by employing the appropriate 
plant hormones in accordance with known methods and the 
shoots transferred to rooting medium for regeneration of 
plants. The plants may then be grown to seed and the seed 
used to establish repetitive generations and for isolation 
of vegetable oils. 
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The invention now being generally described, it will be 
more readily understood by reference to the following 
examples which are included for purposes of illustration 
only and are not intended to limit the present invention. 

5 

EXAMPLES 

Example 1 Cuphea KAS Factor A and B Gene Cloning 

Total RNA isolated from developing seeds of Cuphea 

10 hooker iana and Cuphea pul lcherrima was used for cDNA 
synthesis in commercial 1 -based cloning vectors. For 
cloning each type of KAS gene, approximately 400,000-500,000 
unamplified recombinant phage were plated and the plaques 
transferred to nitrocellulose. For KAS. factor B cloning 

15 from C. hooker iaiha, a mixed probe containing Brassica napus 
KAS factor B and Ricinus communis (Castor) KAS factor B 
radiolabeled cDNA's was used. Similarly, a mixed probe 
containing Brassica napus KAS factor A and Ricinus communis 
KAS factor A cDNA clones was used to obtain C- hookeriana 

20 KAS factor A genes. For KASIII, a spinach KASIII cDNA 

clone obtained from Dr. Jan Jaworski was radiolabeled and 
used as a probe to isolate a KASIII clone from C. 
hookeriana. For KAS B and KAS A cloning from C. 
pullcherrima , C. hookeriana KAS B and KAS A genes chKAS B-2 

25 and chKAS A-2-7 (see below) were radiolabeled and used as 
probes . 

DNA sequence and translated amino acid sequence for 
Cuphea KAS clones are provided in Figures 1-9. Cuphea 
hookeriana KAS factor B clones chKAS B-2 and chKAS B-31-7 
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are provided in Figures 1 and 2. Neither of the clones is 
full length. Cuphea hooker iana KAS Factor A clones chKAS A- 
2-7 and chKAS A-l-6 are provided in Figures 3 and 4. chKAS 
A-2-7 contains the entire encoding sequence for the KAS 
factor protein. Based on comparison with other plant 
synthase proteins, the transit peptide is believed to be 
represented in the amino acids encoded by nucleotides 12 5- 
466. chKAS A-l-6 is not a full length clone although some 
transit peptide encoding sequence is present. Nucleotides 
1-180 represent transit peptide encoding sequence, and the 
mature protein encoding sequence is believed to begin at 
nucleotide 181 . 

Cuphea pullcherrima KAS factor B clones cpuKAS B/7-8 
and cpuKAS B/8-7A are provided in Figures 5 and 6. Both of 
the clones contain the entire encoding sequences for the KAS 
factor B proteins. The first 35 amino acids of cpuKAS B/7-8 
are believed to represent the transit peptide, with the 
mature protein encoding sequence beginning at nucleotide 
233. The first 39 amino acids of cpuKAS B/8-7A are believed 
to represent the transit peptide, with the mature protein 
encoding sequence beginning at nucleotide 209. Cuphea 
pullcherrima KAS factor A clones cpuKAS A/p7-6A and cpuKAS 
A-p8-9A are provided in Figures 7 and 8. Both of the clones 
contain the entire encoding sequences for the KAS factor A 
proteins. Translated amino acid sequence of cpuKAS A/p7-6A 
is provided. The mature protein is believed to begin at the 
lysine residue encoded 595-597, and the first 126 amino 
acids are believed to represent the transit peptide. The 
DNA sequence of KAS A clone cpuKAS A-p8-9A is preliminary. 



WO 98/46776 18 PCT/US98/07114 

Further analysis will be conducted to determine final DNA 
sequence and reveal the amino acid sequence encoded by this 
gene . 

DNA and translated amino acid sequence of Cuphea 
5 hookeriana KASIII clone chKASIII-27 is provided in Figure 9. 
The encoding sequence from nucleotides 37-144 of chKASIII-27 
• are believed to encode a transit peptide, and the presumed 
mature protein encoding sequence is from nucleotides 145- 
1233 . 

10 Deduced amino acid sequence of the C. hookeriana KAS 

factor B and KAS factor A cDNA's reveals strong homology to 
the Brassica napus and Ricinus communis clones previously 
reported. The C. hookeriana KAS factor B clone is more 
homologous to the Ricinus and Brassica KAS factor B clones 

15 (94% and 91% respectively) than it is to the Ricinus and 
Brassica KAS factor A clones (60% for both) . Furthermore, 
the C. hookeriana KAS factor A clone is more homologous to 
the Ricinus and Brassica KAS factor A clones (85% and 82% 
respectively) than it is the Ricinus and Brassica KAS factor 

20 B clone (60% for both) . The C. hookeriana KAS factor B 
cDNAs designated as chKAS B-2 and chKAS B-31-7 are 96% 
identical within the mature portion of the polypeptide. 
Similarly, the deduced amino acid sequence of the mature 
protein regions of the C. hookeriana KAS factor A clones 

25 chKAS A-2-7 and chKAS A-l-6 are 96% identical. The C. 

pullcherrima KAS clones also demonstrate homology to the R. 
communis and Brassica napus KAS clones. The mature protein 
portion of all of the KAS factor A family members in the 
different Cuphea species are 95% identical. Similarly the 
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mature protein portion of the KAS factor B genes in Cuphea 
' are also 95-97% identical with each other. However there is 
only approximately 60% sequence identity between KAS factor 
B and KAS factor A clones either within the same or 

5 different species of Cuphea. 

I 
i 

\ Example 2 Levels and Patterns of Expression 

To examine tissue specificity of KAS expression in 
Cuphea hookeriana, Northern blot analysis was conducted 
10 using total RNA isolated from seed, root, leaf and flower 
tissue. Two separate but identical blots were hybridized 
with either chKAS B-31-7 or chKAS A-2-7 coding region 
probes. The data from this RNA blot analysis indicate that 
KAS . B is expressed at a similar level in all tissues 

15 examined, whereas KAS A expression is detected only in the 
seed. These results also demonstrate a different level of 
expression for each of the synthases. KAS A is an abundant 

I message, whereas KAS B is expressed at low levels. 

Furthermore, even under highly stringent hybridization 

20 conditions (65_C, 0.1 X SSC, 0.5% SDS) , the KAS A probe 

hybridizes equally well with two seed transcripts of 2.3 and 
1.9 kb. The larger hybridizing band is likely the 
transcript of the KAS A-2-7 gene since the size of its cDNA 
is 2 046bp, and the number of clones obtained from cDNA 

25 screening corresponds well with the apparent mobility of the 
mRNA and its abundance on the blot. 
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Example 3 Expression of Plant KAS Genes in E.coli 

DNA fragments encoding the mature polypeptide of the 
Cuphea hookeriana KAS A cDNAs and the Cuphea pullcherrima 
5 KAS B cDNAs were obtained by PCR and cloned into a 

QIAexpress expression vector (Qiagene) . Experimental 
conditions for maximum level of expression were determined 
for all of these clones and the parameters for highest level 
of soluble fraction were identified. Cells are grown in 

10 ECLB media containing 1M sorbitol and 2.5 mM betaine 

overnight and subcultured as a 1:4 dilution in the same 
medium. Cells are then grown for 2 hours (to approximately 
.6-. 8 O.D.) and induced with 0 . 4 mM IPTG and allowed to grow 
for 5 more hours . 

15 Enzyme activity of the affinity purified recombinant 

enzymes obtained from over-expression of the chKAS A-2-7 and 
cpuKAS B/8-7A clones was measured using a wide range of 
acyl-ACP substrates (6:0- to 16:1-ACP). The activity 
profile for cpuKAS B/8-7A is provided in Fig. 10. The data 

20 demonstrate that the enzyme is active with all acyl-ACP 

substrates examined, although activity on 6:0 to 14:0-ACP 
substrates is substantially greater than the activity on 
16:0 and 16:1 substrates. 

The activity profile of the C. hookeriana KAS A clones 

25 chKAS A-2-7 and chKAS A-l-6 is provided in Figure 11. The C. 
hookeriana KAS A clones are most active with C:6, and have 
the least activity with C:16:0 substrates. However, the 
activity of this clone on even the preferred C6 : 0 substrate 
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is 50 fold lower than the activity of the C. pullcherrima 
KAS B clones. 

A fragment containing the mature protein encoding 
portion of a R. communis KAS factor A clone was also cloned 
5 into a QIAexpress expression vector, expressed in E. coli 
and the enzyme affinity purified as described above. The 
activity profile for castor KAS A is provided in Figure 12. 
Highest activity is observed with C14:0 substrates, although 
some activity is also seen with C6:0 and C16:l. In 

10 comparison, the activity profile obtained from purified R. 
communis KAS factor B also using the QIAexpress expression 
system is provided in Figure 13 . The KAS B clone 
demonstrates substantially higher levels of activity (10 
fold and higher) than the R . communis KAS A clone. The 

15 preference of the KAS factor B for 6:0- to 14:0-ACP 

substrates is consistent with the previous observations that 
this protein provides KAS I activity. 

Example 4 KAS and TE Expression in Transgenic Seed 

20 Both the CpFatBl (C. hookeriana thioesterase cDNA; 

Dehesh et al . (1996) Plant Physiol. 110:203-210) and the 
chKAS A-2-7 were PCR amplified, sequenced, and cloned into a 
napin expression cassette. The napin/cp FatBl and the 
napin/KAS A-2-7 fusions were ligated separately into the 

25 binary vector pCGN1558 (McBride and Summerfelt (Pi .Mol . Biol . 
(1990) 14:269-276) and transformed into A. tumefaciens, 
EHA101. The resulting CpFatBl binary construct is pCGN5400 
and the chKAS A-2-7 construct is pCGN5401. Agrohac terium 
mediated transformation of a Brassica napus canola variety 
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was carried out as described by Radke et al . (Theor. Appl . 
Genet. (1988) 75:685-694; Plant Cell Reports (1992) 
11:499-505). Several transgenic events were produced for 
each of the pCGN5400 and pCGN5401 constructs. 

A double gene construct containing a napin/cpFatBl 
expression construct in combination with a napin/chKAS A-2-7 
expression construct was also assembled, ligated into a 
binary vector and used for co-cultivation of a canola 
Brassica variety. The binary construct containing the 
chFatBl and chKAS A-2-7 expression constructs is pCGN5413 . 

Fatty acid analysis of 26 transgenic lines containing 
chKAS A-2-7 (5401 lines) showed no significant changes in 
the oil content or profile as compared to similar analyses 
of wild type canola seeds of the transformed variety. 

Fatty acid analysis of 3 6 transgenic lines containing 
cpFatBl (5400 lines) showed increased levels of C:8 and C:10 
in transgenic seeds. The highest level of C:8 observed in a 
pool seed sample was 4.2 mol% . The C:10 levels were between 
30 and 35% of the C:8 content. Fatty acid analysis of 25 
transgenic lines containing the TE/KAS A tandem (5413 lines) 
demonstrated an overall increase in both C:8 and C:10 levels 
relative to those observed with TE containing lines (5400) 
alone. In lines containing the cpFatBl construct alone, the 
average level of C:8 average were 1.5 mol%, whereas the C:8 
average levels in TE/KAS A tandem containing lines was 2.37 
mol%. The ratio of C:8 to C:10 remained constant in both 
populations. The number of transgenic events relative to 
the C:8 content are presented in Figure 14. These data show 
that the transgenic events with tandem TE/KAS A construct 
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yield more lines with higher levels of C:8 than those events 
with single TE construct. For example, several lines 
containing nearly 7 mole% C8 were obtained with the .TE/KAS A 
PCGN5413 construct, whereas the highest C8 containing line 
5 from the pCGN5400 TE alone transformation contained 4.2 
mole% C8. 

Half seed analysis of the T3 generation of transgenic 
canola plants expressing a ChFatB2 (C. hookeriana 
thioesterase; Dehesh et a 1 . (1996) The Plant Journal 3:167- 

10 172) indicate that these plant can accumulate up to 22 

weight% (33 mol%) of 8:0 and 10:0 fatty acids (4804-22-357). 
Segregation analysis shows that these transf ormants contain 
two loci and that they are now homozygous. Selected plants 
grown from these half seeds were transferred into the 

15 greenhouse and later crossed with Tl transf ormants that had 
been transformed with either Cuphea hookeriana KAS A (5401) 
alone or KAS A/CpFatBl double constructs (5413) . 

Fatty acid analysis of several events resulting from 
the crosses between transgenic lines containing ChFatB2 

20 (4804-22-357) and chKAS A-2-7 (5401-9), reveal an increase 
in the ratio of C:10/C:8 levels (Figure 15). This C:10/C:8 
ratio in nearly all of the transgenic events containing 
ChFatB2 TE alone fluctuates between 3 and 6, whereas in the 
Fl generation of transgenic containing both the TE and the 

25 KAS A-2-7, the ratio can be as high as 22. This increase in 
C:10 levels is accompanied by an increase in the total C:8 
and C:10 content (Figure 16). The sum of the C:8 and C:10 
fatty acids in the heterozygous Fl lines is as high as those 
in the homozygous parent line (4804-22-357) , whereas the 
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heterozygous lines usually contain substantially less C:8 
and C:10 than the homozygous lines. 

Similar results were observed in Fl generation seeds 
resulting from crosses performed between 4804-22-357 
5 (ChFatB2) and the 5413-17 event (CpFatBl and chKAS A-2-7 
tandem). Levels of C:8 and C:10 in the 5413-17 line were 
6.3 and 2.8 mol% respectively. Data presented in Figure 17 
show that there is shift towards C:10 fatty acids as was 
observed with the 4804-22-357 (ChFatB2) x 5401-9 (chKAS A-2- 

10 7) crosses. Furthermore, Figure 18 indicates the presence 
of two separate populations of heterozygotes . Those 
containing approximately 9-11 weight percent C:10 + C:8 are 
believed to represent offspring containing a single copy of 
the ChFatBl TE gene and no copies of the CpFatBl and chKAS A 

15 genes from 5413. Those plants containing approximately 15- 
20 weight percent C:10 + C:8 are believed to represent the 
heterozygotes containing a single ChFatBl TE gene as well as 
the CpFatBl and chKAS A genes from 5413. Thus, the level of 
the C:10 + C:8 fatty acids does not decrease to 50% of that 

20 detected in parent lines when a copy of the ChKAS A gene is 
present . 

To further characterize the chain length specificity of 
the Cuphea hookerlana KAS A enzyme, crosses between 
transgenic Brassica napus lines containing a California Bay 
25 (Umhellularia calif ornica) 12:0 specific thioesterase , Uc 
FatBl (USPN 5,344,771) and chKAS A-2-7 (5401-9) were made. 
Half seed analysis of transgenic plants containing Uc fatBl 
have previuosly indicated that these plants can accumulate 
up to 52 mol% C12:0 in the seed oil of homozygous dihaploid 
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lines (LA86DH186) . Crosses between the line LA86DH186 and 
untrans formed control Brassica demonstrated a decrease in 
the C12:0 levels. 

However, crosses between LA86DH186 and the 5401-9 
hemizygous line led to an accumulation of up to 57 mol% 
C12:0 in the seed oil of Fl progeny (Figure 19). 
Interestingly, in crosses with LA86DH186 x untransf ormed 
control line and LA86DH186 x 5401-9, levels of C14 : 0 in the 
seeds of the Fl progeny decreased to 50% of the levels 
obtained in homozygous LA86DH186 lines (Figure 20). 
Furthermore, increases in the proportion of C12:0 fatty acid 
resulted in a substantial decline in the proportions of all 
the long-chain. fatty acyl groups (C16:0, C18:0, C18:2, and 
C18:3). These results indicate that the ChKAS A-2-7 is an 
enzyme with substrate specificity ranging from C6 : 0 to 
C10:0-ACP, and that its over-expression ultimately reduces 
the longer chain acyl-ACP pools.. 

Further evidence is obtained in support of the chain 
length specificity of the ChKAS A-2-7 in crosses of the 
5401-9 line with a transgenic line (5266) expressing an 
18:1/18:0 TE from Garcinla mangostana (GarmFatAl, US patent 
application No. 08/440,845). Transgenic Brassica line 5266 
has been shown to accumulate up to 24 mol% C18:0 in the seed 
oil of homozygous lines (Figure 21) . However, in the seed 
oil of Fl progeny of crosses between 5266 and 5401-9 levels 
of C18:0 were reduced to approximately 12 mol%. 
Furthermore, levels of C16:0 generated from these crosses 
was similar to the levels obtained from the seed oil of 
nontransgenic control plants. 
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Example 5 In vitro Analysis of Plant KAS Enzymes 

Seed extracts were prepared from developing seeds of 
nontransgenic controls or transgenic Brassica expressing 
chKAS A-2-7 as described in Slabaugh et al. {Plant Journal, 
5 1998 in press) and Leonard et al . (Plant Journal, 1998, in 
press) . In vitro fatty acid synthesis assays were performed 
as described by Post-Beittenmiller (J. Biol. Chem. (1991), 
266:1858-1865). Extracts were concentrated by ammonium 
sulfate precipitation and desalting using P-6 columns (Bio- 

10 Rad, Hercules, CA) . Reactions (65p.l) contained 0 . 1M 

Tris/HCl (pH 8.0), 1 mM dithiothreitol , 25 mM recombinant 
spinach ACPI, 1 mM NADH, 2 mM NADPH, 50 |IM malonyl-CoA, 10 
MM [l-"C]acetyl-CoA (50 mCi/mmol), Img/ml BSA, and 0.25 
mg/ml seed protein. Selected seed extracts were 

15 preincubated with cerulenin at 23°C for 10 min . Reaction 
products were separated on an 18% acrlamide gel containing 
2.25M urea, electroblotted onto to nitrocellulose and 
quntitated by phosporimaging using Image QuaNT software 
(Molecular Dynamics, Sunnyvale, CA) . Authentic acyl-ACPs 

20 were run in parallel, immunoblotted and finally detected by 
anti-ACP serum to confirm fatty acid chain lengths. 

The results (Figure 22) indicate that the fatty acid 
synthesis capabilities of transgenic Brasica (5401-9) seed 
extracts was greater than that obtained from in the 

25 nontransgenic controls as measured by the relative abundance 
of C8:0- and C10:0-ACP at all time points tested. In 
addition, pretreatment of the extracts with cerulenin, 
markedly reduced the synthesis of longer chain fatty acids 
in both the transgenic and nontransgenic control seed 
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extracts. However, the extension of the spinach-ACP was much 
less inhibited in the seed extracts from the transgenic 
lines than in the seed extracts of nontransgenic control 
Brassica . 

5 These data further support that Ch KAS A-2-7 is a 

condensing enzyme active on medium chain acyl-ACPs, and that 
expression of this enzyme in plants results in enlarged 
substrate pools to be .hydrolyzed by medium-chain specific 
thioesterases. Furthermore, these data suggest that chKAS 
10 A-2-7 also is a cerulenin-resis tant condensing enzyme. 

All publications and patent applications mentioned in 
this specification are indicative of the level of skill of 
those skilled in the art to which this invention pertains. 
15 All publications and patent applications are herein 

incorporated by reference to the same extent as if each 
individual publication or patent application was 
specifically and individually indicated to be incorporated 
by reference. 

20 Although the foregoing invention has been described in 

some detail by way of illustration and example for purposes 
of clarity of understanding, it will be obvious that certain 
changes and modifications may be practiced within the scope 
of the appended claim. 
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13. The construct of Claim 5 wherein said encoding 
sequence is cpuKAS A/p8-9A. 

14. The construct of Claim 5 wherein said encoding 
sequence is chKASIII-27 . 

5 15. An improved method for producing medium-chain fatty 

acids in transgenic plant seeds by expression of a plant 
medium-chain thioesterase protein heterologous to said 
transgenic plant, 

the improvement comprising expression of a plant synthase 
10 factor protein heterologous' to said transgenic plant in 
conjunction with expression of said plant medium-chain 
thioesterase, whereby the percentage of medium-chain fatty 
acids produced in seeds expressing both a plant synthase factor 
protein and a plant medium-chain thioesterase protein is 
15 increased as compared to the percentage of medium-chain fatty 

acids produced in seeds expressing only said plant medium-chain 
thioesterase protein. 

16. The method of Claim 15 wherein said medium-chain 
thioesterase protein is a ChFatB2 protein. 
20 17. The method of Claim 15 wherein said medium-chain 

thioesterase protein is a CpFatBl protein. 

18. The method of Claim 15 wherein said medium-chain 
thioesterase protein is a C12 preferring thioesterase from 
California bay. 

25 19. The method of Claim 15 wherein said plant synthase 

factor protein is expressed from a construct according to Claim 
1. 

20. The method of Claim 19 wherein said synthase factor A 
protein is from a Cuphea species. 
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21. The method of Claim 20 wherein said Cuphea species is 
C. hookeriana or C. pullcherrima . 

22. A method of altering the medium-chain fatty acid 
composition in plant seeds expressing a heterologous plant 
medium-chain preferring thioesterase, wherein said method 
comprises 

providing for expression of a plant synthase factor 
protein heterologous to said transgenic plant in conjunction 
with expression of a plant medium-chain thioesterase protein 
heterologous to said transgenic plant, whereby the composition 
of medium-chain fatty acids produced in said seeds is modified 
as compared to the composition of medium-chain fatty acids 
produced in seeds expressing said plant medium-chain 
thioesterase protein in the absence of expression of said plant 
synthase factor protein. 

23. The method of Claim 22 wherein said medium-chain 
thioesterase protein is a ChFatB2 protein. 

24. The method of Claim 22 wherein said medium-chain 
thioesterase protein is a CpFatBl protein. 

25. The method of Claim 22 wherein said medium-chain 
thioesterase protein is a C12 preferring thioesterase from 
California bay. 

26. The method of Claim 22 wherein said plant synthase 
factor protein is expressed from a construct according to Claim 
1. 

27. The method of Claim 26 wherein said synthase factor A 
protein is from a Cuphea species . 

28. The method of Claim 27 wherein said Cuphea species is 
C. hookeriana or C. pullcherrima. 
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29. The method of Claim 22 wherein said fatty acid 
composition is enriched for CIO fatty acids. 

30. The method of Claim 22 wherein said fatty acid 
composition is enriched for C12 fatty acids. 

31. The method of Claim 22 wherein said fatty acid 
composition is enriched for at least one medium chain fatty 
acid and at least one other medium chain fatty acid is 
decreased. 

32. The method of Claim 31 wherein said enriched fatty 
acid is C12 and said decreased fatty acid is C14 . 
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